Search CORE

12 research outputs found

Density Representations for Words and Hierarchical Data

Author: Athiwaratkun Praphruetpong (Ben)
Publication venue
Publication date: 30/05/2019
Field of study

We demonstrate the benefits of probabilistic representations due to their expressiveness which allows for flexible representations, their ability of capture uncertainty, and their interpretable geometric structures that are suitable for modeling hierarchical data. We show that multimodal densities can be effectively used to represent words in natural text, capturing possibly multiple meanings and their nuances. Probability densities also have natural geometric structures which can be used to represent hierarchies among entities through the concept of encapsulation; that is, dispersed distributions are generic entities that encompass more specific ones. We show an effective approach to train such density embeddings by penalizing order violations which are defined through on asymmetric divergences of probability densities

eCommons@Cornell

Probabilistic FastText for Multi-Sense Word Embeddings

Author: Anandkumar Anima
Athiwaratkun Ben
Wilson Andrew Gordon
Publication venue
Publication date: 01/01/2018
Field of study

We introduce Probabilistic FastText, a new model for word embeddings that can capture multiple word senses, sub-word structure, and uncertainty information. In particular, we represent each word with a Gaussian mixture density, where the mean of a mixture component is given by the sum of n-grams. This representation allows the model to share statistical strength across sub-word structures (e.g. Latin roots), producing accurate representations of rare, misspelt, or even unseen words. Moreover, each component of the mixture can capture a different word sense. Probabilistic FastText outperforms both FastText, which has no probabilistic model, and dictionary-level probabilistic embeddings, which do not incorporate subword structures, on several word-similarity benchmarks, including English RareWord and foreign language datasets. We also achieve state-of-art performance on benchmarks that measure ability to discern different meanings. Thus, the proposed model is the first to achieve multi-sense representations while having enriched semantics on rare words.Comment: Published at ACL 201

arXiv.org e-Print Archive

Crossref

Caltech Authors

Probabilistic FastText for Multi-Sense Word Embeddings

Author: Anandkumar Anima
Athiwaratkun Ben
Wilson Andrew Gordon
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 07/06/2018
Field of study